Analysis of evidence-theoretic decision rules for pattern classification
نویسنده
چکیده
The Dempster-Shafer (D-S) theory of evidence provides a convenient framework for modeling uncertainty in situations where the available evidence is limited or weak. Such situations typically arise in supervised pattern recognition when the training set is small or does not contain samples from all classes. In D-S theory, someone's feeling of uncertainty concerning a certain set of hypotheses is represented by a belief function, that generalizes the classical concept of probability distribution. Recently, several classiication methods have been proposed for determining a posterior belief function over a set of classes, given a pattern x to be classiied. However, the problem of making decisions based on a belief function and an arbitrary loss matrix is not simple, since conventional Bayes decision analysis assumes uncertainty to be quantiied by a probability distribution. This paper provides detailed analysis of several decision rules that can be based on a posterior belief function and a quantiication of the consequences of each action (assignment to a class or rejection). Diierent decision strategies are proposed, based on the concepts of upper, lower and pignistic expectation that generalize the usual concept of mathematical expectation underlying Bayes decision theory. The decision rules and the corresponding decision regions in feature space are analyzed under the assumptions of completeness and incompleteness of the training set. As expected, patterns situated close to class boundaries tend to be rejected according to the three decision rules studied. Additionally, the more conservative strategies of upper and pignistic expected loss minimization also lead to the rejection of \atypical" patterns situated far from training samples in feature space. When the training set is not complete, the available options are assignment to one of the known classes, assignment to the unknown class, and rejection. By imposing some restrictions on the loss matrix, it is shown that various conngurations of the decision regions can be induced by varying only two parameters corresponding to the costs of rejection and misclassiication in the unknown class, respectively. The analysis is illustrated using real data from a river quality monitoring application. Abstract The Dempster-Shafer theory provides a convenient framework for decision making based on very limited or weak information. Such situations typically arise in pattern recognition problems when patterns have to be classiied based on a small number of training vectors, or when the training set does not contain samples from all classes. This paper examines diierent strategies that can be applied in …
منابع مشابه
An Integrated DEA and Data Mining Approach for Performance Assessment
This paper presents a data envelopment analysis (DEA) model combined with Bootstrapping to assess performance of one of the Data mining Algorithms. We applied a two-step process for performance productivity analysis of insurance branches within a case study. First, using a DEA model, the study analyzes the productivity of eighteen decision-making units (DMUs). Using a Malmquist index, DEA deter...
متن کاملRough sets theory in site selection decision making for water reservoirs
Rough Sets theory is a mathematical approach for analysis of a vague description of objects presented by a well-known mathematician, Pawlak (1982, 1991). This paper explores the use of Rough Sets theory in site location investigation of buried concrete water reservoirs. Making an appropriate decision in site location can always avoid unnecessary expensive costs which is very important in constr...
متن کاملارائه مدلی برای پیشبینی نوع صافی همودیالیز با تکنیکهای دادهکاوی
Introduction: Inadequate dialysis for patients' kidneys as a mortality risk necessitates the presence of a pattern to assist staff in dialysate part to provide the proper services for dialysis patients and also the proper management of their treatment. Since the role of buffer type in the adequacy of dialysis is determinative, the present study is aimed at determining hemodialysis buffer type. ...
متن کاملUSING DISTRIBUTION OF DATA TO ENHANCE PERFORMANCE OF FUZZY CLASSIFICATION SYSTEMS
This paper considers the automatic design of fuzzy rule-basedclassification systems based on labeled data. The classification performance andinterpretability are of major importance in these systems. In this paper, weutilize the distribution of training patterns in decision subspace of each fuzzyrule to improve its initially assigned certainty grade (i.e. rule weight). Ourapproach uses a punish...
متن کاملAn Information-Theoretic Approach to the Pre-pruning of Classification Rules
The automatic induction of classification rules from examples is an important technique used in data mining. One of the problems encountered is the overfitting of rules to training data. In some cases this can lead to an excessively large number of rules, many of which have very little predictive value for unseen data. This paper is concerned with the reduction of overfitting. It introduces a t...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Pattern Recognition
دوره 30 شماره
صفحات -
تاریخ انتشار 1997